Quark Excitations Through the Prism of Direct Photon Plus Jet at the LHC 
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The quest to know the structure of matter has resulted in various theoretical speculations wherein 
additional colored fermions are postulated. Arising either as Kaluza-Klein excitations of ordinary 
quarks, or as excited states in scenarios wherein the quarks themselves are composites, or even 
in theories with extended gauge symmetry, the presence of such fermions (q*) can potentially be 
manifested in 7 + jet final states at the LHC. Using unitarized amplitudes and the CMS setup, we 
demonstrate that in the initial phase of LHC operation (with an integrated luminosity of 200 pb~ ) 
one can discover such states for a mass upto 2.0 TeV. The discovery of a q* with a mass as large 
as ~5 TeV can be acheived for an integrated luminosity of ~ 140 fb -1 . We also comment on the 
feasibility of mass determination. 

PACS numbers: 12.60.Rc, 13.40.-f, 13.85.Qk 



I. INTRODUCTION 

The replication of fermion families, while being of pro- 
found significance in our understanding of fundamental 
issues such as CP-violation and baryogenesis, nonethe- 
less is puzzling in its own right. Despite its enormous suc- 
cess in explaining all observed phenomena in the regime 
of particle physics, the Standard Model (SM) has been 
entirely unable to proffer any insight into this aspect. In- 
deed, though the observed mass hierarchies and fermion 
mixings can easily be accommodated, the SM framework, 
by its very structure, is unable to even ask such questions 
of itself. This has led to various speculative ideas seeking 
to explain these ill-understood issues. Prominent among 
these are (i) models with extended (family) symmetry, 
(ii) constructions based on higher dimensional theory 
(with or without a string theory motivation) and (Hi) the 
possibility of quark-lepton compositcness, namely that 
the SM fermions are not elementary at all. 

Many of the ideas discussed in this article would be 
equally applicable — possibly with minor variations — to 
theories belonging to any of these three classes. How- 
ever, for the sake of concreteness, we shall consider the- 
ories of compositcness as the basic template for our dis- 
cussions. Part of the motivation lies in the fact of these 
theories having a more straightforward ultraviolet com- 
pletion and, furthermore, suffering from a fewer number 
of extra channels, thereby reducing possible ambiguities. 

In such theories, the fundamental constituents of mat- 
ter, very often termed preons[l|, are postulated to experi- 
ence an hitherto unknown force on account of an asymp- 
totically free but confining gauge interaction Q, which 
becomes very strong at a characteristic scale A, thereby 
leading to bound states (composites) to be identified 
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with quarks and/or leptons. In many such models 0, Hj], 
though not all, quarks and leptons share at least some 
common constituents. 

Accepting this hypothesis would naturally lead to the 
existence of excited states of fermions at a mass scale 
comparable to the dynamics of the new binding force. 
In the simplest phenomenological models Q , the excited 
fermions are assumed to have both spin and isospin 1/2 
and to have both their left- and right-handed components 
in weak isodoublcts (i.e. they are vector- like) . Similar is 
the case for, say higher-dimensional models wherein the 
known universe is constrained to be on a 4-dimensional 
subspace (a 3-brane) while the SM fields — in particular, 
the fermions — live in all the dimensions. The analogues 
of the excited fermions would be the Kaluza-Klein ex- 
citations with the mass scale being identified with the 
inverse of the compactification scale. 

Given that the "excited states" do suffer the SM gauge 
interactions, these may be produced at high-energy col- 
liders and, subsequently, would decay, radiatively, into an 
ordinary fermion and a gauge boson (photon, W, Z or 
gluon). At an e + e~ collider, charged excited fermions 
could be pair-produced via s-channel (7 and Z) ex- 
changes in collisions, while for excited neutrinos only Z 
exchange contributes. Although t-channcl diagrams are 
also possible (W for v* and 7/Z for e*), such contri- 
butions to the overall pair production cross-section are 
generally much smaller on account of the smallness of 
the cross-couplings Q between the excited state, its SM 
counterpart and a gauge boson. This very same cou- 
pling, on the other hand, may be used to singly produce 
such states (through both s- and t-channel diagrams). 
The four LEP collaborations have used these (and other) 
modes to essentially rule out such excitations almost upto 
the kinematically allowed range @. At the hera, on 
the other hand, both excited leptons and quarks may be 
produced singly through t-channel diagrams and these 
processes have been looked at without any positive re- 
sults 0- 

If quarks and leptons are composite particles made up 
of smaller constituents, phenomenological effects may be 
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observable at the LHC and might even show up at Teva- 
tron once sufficiently large luminosities are accumulated 
and analysed. If the compositeness/excitation scale (A) 
is not too high, then excited quarks can be produced on 
shell, while at energies sufficiently below A such an even- 
tuality would manifest itself as an effective four fermion 
contact interaction invariant under SM guage transfor- 
mations [1, Based on phenomenological studies of 
flavour independent contact interactions for two photon 
production, the lower bound has been estimated to be 
A± > 2.88 (3.24) TeV at 95% CL for an integrated lumi- 
nosity of 100 (200) fb" 1 Q3 at the LHC. 

Both the experiments at the Tevatron, the CDF and 
D0, have searched for excited quarks. The latter are as- 
sumed to couple to the SM particles primarily through 
gauge couplings. Their most visible signature could be 
either pair production or single excited state production 
via quark-gluon fusion, provided the q*qg coupling is suf- 
ficiently large. Enhanced dijet production rate with an 
invariant mass peak above the SM continuum is one of 
the prominent signals, extensively searched for at the 
Tevatron and the D0 collaboration has excluded the 
mass range of 200-720 GeVpj]]. Similarly, the CDF col- 
laboration has excluded a mass range of 80-570 GeV 
[HI, EH based on searches with various final states. In 
a similar vein, the CDF collaboration has put a lower 
bound of A > 2.81 GeV at 95% CL using the qq -> ev 
prcoesspjj whereas the D0 collaboration rules out A < 



2.0 TeV at 95% CL from diject mass peak searches [15 1. 

At the LHC, both the ATLAS and the CMS collabora- 
tions have predicted the sensitivity in the dijet produc- 
tion mode. The ATLAS collaboration has xlaimed that 
the use nof dijet angular distributions would allow con- 
tact interactions to be probed upto A =10 TeV with a 
integrated luminosity of 700 pb^ 1 . The CMS collabora- 
tion, on the other hand, has estimated that A = 6.2 TeV 
can be excluded at 95% CL with a luminosity of 100 pb^ 1 
and that 5a sensitivity could be reached for A =8 TeV 
with just lftV 1 of data[l6j|. Recently, the possibility of 
top quark compositeness has been explored through the 
pp — > titi production process and it has been estimated 
that a 5(j excess can be observed for a new state of 2 TeV 

As an effective tool for the measurement of gluon 
density inside the colliding hadrons and for precision 
test of pQCD predictions, the isolated 7+jet final state 
has been studied with great detail at the Tevatron 
collider pH, [l9L l2(i| and fixed target experiments [2l|. 
Since 7 + jet final state will be one of the key back- 
grounds for the H — > 77 search at the LHC, extensive 
isolation studies addressing all known issues have been 
performed with this process both theoretically and with 
detailed detector simulations. It is also an important 
background for many new physics scenario e.g. Large Ex- 
tra Dimensions [22I [23j], Randall Sundrum Gravitons[24| 
etc. It will, thus, be very interesting to look at 7+jet as 
a probe of excited quarks in view of the unprecedented 
energy scale at LHC and the in-depth knowledge of this 



process gleaned from previous experiments and studies. 

The rest of the paper has been organized as follows. 
In section II we have discussed the effective Lagrangian 
for the theory and new physics contribution. In Sections 
III and IV, we respectively discuss the backgrounds and 
event generation. Section V describes the photon and 
jet algorithms used for the analysis. In section VI we 
discuss the smearings due to detector resolution effects. 
Section VII gives the details of kinematical variables used 
to separate the signal from the background whereas Sec- 
tion VIII deals with isolation study. The significance of 
signal and discovery are discussed in section IX and X re- 
spectively. In section XI we have presented the result of 
the analysis. Systematics is discussed in detail in section 
XII followed by our conclusions and outlook. 



II. NEW PHYSICS CONTRIBUTION TO 7 + Jet 
PRODUCTION 



With our interest lying not in the pair-production of 
such excited states, but rather on their contribution to 
the photon plus single jet rates at a hadronic collider, 
we confine ourselves to examining only the relevant part 
of the Lagrangian, namely the (chromo-)magnctic tran- 
sition between ordinary and excited states. In general, 
these may be parametrized by 
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where the index i runs over the three SM gauge groups, 
viz. SU(3), SU{2) and 17(1) and g h Gf^ v and T? are 
the corresponding gauge couplings, field strength ten- 
sors and generators respectively. The dimensionless con- 
stants bi are, a priori, unknown and presumably of or- 
der unity. With these determining both the production 
rates and the branching into various modes, clearly, the 
phenomenology would depend considerably on their (rel- 
ative) sizes. In this article, we shall make the simplifying 
assumption that the excited states do not couple at all 
to the weak gauge bosons, but do so with the gluons and 
the photon. At first glance, this might seem incompat- 
ible with a SU(2) (g) (7(1) invariant structure. However, 
complicated embeddings could be the answer. More than 
this, since the assumption would not change the results 
qualitatively, it, at least, has the merit of reducing the 
number of possible couplings and hence simplifying the 
analysis. 

A further point needs to be noted here. With the La- 
grangian of eq.([T]) being a higher dimensional operator, 
the cross sections would typically grow with the center 
of mass energy, consequently violating unitarity. This 
is not unexpected in an effective theory as the term in 
eq-CEJ) is only the first term and the loss of unitarity, to 
a given order, is presumably cured once suitable higher 
dimensional operators are included. An equivalent way 
to achieve the same goal is to consider the i>, to be form 



3 



factors rather than constants. To this end, we shall define 
the q*q r y and q*qg vertices to be given by 



a^v P 



q*qgM ■ -j- I 1 , A2 



2 \ ~"3 



(2) 



0> T Q 



where Q denotes the relevant momentum transfer and 
fi ~ 1 arc dimcnsionlcss constants related to 6j of cqn.fl]). 
It can be checked that, for Q 2 = s, unitarity is restored 
as long as the constants ri{ > 1. From now on, eqn.@ 
defines our theory! 1 ! . For the rest of our analysis, we shall 
confine ourselves to a discussion of rij = I. While this 
might seem to be an optimistic choice, it is not quite so. 
In fact, the collider search limits in the literature actually 
correspond to n, = and, thus, our limits would be more 
conservative. 

With the aforementioned Lagrangian, the width of the 
q* is given by 



r(g*) 
r<?+7 

To 
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and can be very large for a heavy q* (see Table IJ). As a 
fat resonance is often difficult to observe, this will turn 
out to have profound consequences. 

TABLE I: F(q*) as a function of M g «(=A) for different cou- 
pling strengths. Both a s and a em are evaluated at M q * . 



Ma 



(TeV) 



T (GeV) 



/l=/3 



1.0 



/i = /s = 0.5 



0.5 
1.0 
2.0 
3.0 
4.0 
5.0 
6.0 
7.0 



34.4 
63.6 
118 
170 
221 
271 
319 
367 



8.61 
15.9 
29.6 
42.6 
55.2 
67.6 
79.8 
91.8 



With the introduction of these (flavour-diagonal) ver- 
tex as in eq.Q, the subprocess qg — > 97 acquires a new 



[1] While a Lagrangian formulation leading to such vertices would 
necessitate a seemingly non-local Lagrangian, this is not unex- 
pected in an effective theory. 




FIG. 1: Production of 7 + Jet final state through excited 
quark mediation (a & c) as well as SM processes (b & d). 



contribution as portrayed in Figflja). Adding this con- 
tribution to the pure QCD one, the ensuing differential 
cross section reads 
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(4) 

with the SM result being recovered in the limit A — > 00. 
The new physics contribution to the differential cross sec- 
tion thus depends on four parameters, namely /i,/3,A 
and the mass of the excited state M q * . For simplicity, we 
assume these to be flavour-independent (within a genera- 
tion, it obviously has to be so). For eq.fl]) to make sense 
as an effective Lagrangian, the masses have to be less 
than A (Ref.[H[ requires that M q , < A/\/2). Note that 
as long as A s, one of /1.3 can always be absorbed in A. 
In our analyses, we would be considering only moderate 
values for these parameters. 

For qq — ► 177, the Feynman diagrams are as in FigOTc- 
d); the differential cross-sections are related to those in 
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cqn.(U]) by crossing symmetry and are given by 
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III. BACKGROUNDS 



The 7+jet final state can be mimicked by many known 
processes of the SM. We consider all the leading contribu- 
tions (both the physics backgrounds as well as the detec- 
tor ones) and broadly categorize these into three classes 
viz., 

• Typc-I: where a photon and a hard jet is produced 
in the hard scattering. 

• Type-II: QCD dijet, where one of the jets fragments 
into a high Et tt° which then decays into a pair of 
overlapping photons and, hence, is registered as a 
single photon. Moreover, in some cases the electro- 
magnetic fraction of a jet can mimic a photon in 
the detector. 

• Type-Ill: Photon + dijet production, where one of 
the jets is either lost or mismeasured. This could 
proceed either from hard processes such as qq — > 
QQli 991 (with all possible interchanges of initial 
and final state partons) or result from W/Z + 7 
production with the heavy bosons decaying into a 
pair of jets. 

While the leading contributions to the Type-I back- 
ground emanates from the SM amplitudes of Fig. [TJ a 
further contribution is displayed in Fig [21 In Figs [3] & HI 
we show the major contributing Fcynman diagrams for 
the Types II & III backgrounds respectively. 

At the LHC, the Type-I background is dominated by 
the Compton process (qg — ► 75), while the other two 
subprocesses, namely anihilation(qq — > 75) and gluon fu- 
sion [gg — > 75) contribute only a small fraction in the 
low transverse momentum (Pt) ranges. For higher Pt, 
the annihilation subproccss can contribute upto about 
~ 23% of the total Type-I background. 

The Type-II background accrues mainly from qg — > qg, 
qq — > qq and gg — > gg. For Pt > 200 GeV (here Pt is the 
Pt of the outgoing partons in center of momentum frame 



FIG. 2: Type-I Background: Additional contribution from 
gluon fusion. Lowest order background emanates from (b) 
and (d) in FigfT] 






FIG. 3: Type-II Background: QCD jet production where a 
jet fakes a photon giving a 7 + Jet final state. 



in a 2 — > 2 hard scattering process) , the total production 
cross section is ~ 10 4 times larger than the Type-I back- 
ground. However, the fact of the probability of a jet fak- 
ing a photon in the detector being ~ 10 -3 — 10~ 4 reduces 
the Type-II background to the same order as the Type-I. 
Moreover, for high transverse momenta the QCD dijet 
falls very rapidly (~ -P^ 4 ), thereby suggesting a simple 
mechanism of suppressing this background 

Although the total Type-Ill background is very small 
compared to the others, for the Pt range under consider- 
ation in this analysis, it turns out to be of the same order 
as gg — > jg. And while nonresonant subprocesses (such 
as the 0(a 2 a) contributions to qq — > qq^f or gg — > qqj) 
can, in principle, be substantial, note that these, in some 
sense are related to the much larger Type-I and Type-II 
backgrounds. Consequently, the former arc typically sup- 
pressed when appropriate phase space cuts are imposed 





FIG. 4: Type-Ill Background: Here W/Z° decays to two jets 
and only one of the jets passes the trigger threshold. 
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TABLE II: Production cross section in different Pt bins for various Standard Model backgrounds with 7 + Jet final state. 



Subprocess 


50-100 GeV 


100-200 GeV 


200-400 GeV 


400-600 GeV 


600-1000 GeV 


1000-1500 GeV 


1500 GeV and above 




(Pb) 


(Pb) 


(Pb) 


(Pb) 


(Pb) 


(Pb) 


(Pb) 


qg -> 79 


7.22 x 10 3 


569 


36.3 


1.53 


2.22 x 10" 1 


1.19 x 10 -2 


7.6 x 10" 4 


qq -> 79 


652 


65.3 


5.56 


3.18 x 10" 1 


5.67 x 10^ 2 


3.76 x 10~ 3 


2.8 x 10~ 4 


99 -» 19 


1.79 


8.6 x 10" 2 


3.1 x 10 -3 


7.04 x 10 -5 


6.32 x 10" 6 


1.75 x 10 -7 


5.8 x 10 -9 


QCD Jet 


1.71 x 10 7 


9.70 x 10 5 


4.44 x 10 4 


1.39 x 10 3 


171 


8.19 


5.34 xlO" 1 


a ^(jj)+7 


5.08 


8.49 x 10" 1 


9.50 x 10~ 2 


6.23 x 10" 3 


1.16 x 10~ 3 


8.48 x 10~ 5 


6.46 x 10" 6 


+ 7 


4.80 


6.93 x 10" 1 


6.19 x 10~ 2 


4.16 x 10~ 3 


7.39 x 10~ 4 


4.67 x 10~ 5 


2.99 x 10~ 6 



a Here the branching fraction is taken into account. 



to reduce the latter. 

In Table-II we show the production cross section for 
all backgrounds in different Pt ranges. 

IV. EVENT GENERATION & CUTS 

The event generation for signal and different back- 
ground processes was done with PYTHiA-v6.325 [2(|. For 
signal event generation the matrix elements of Eq. (j4]) 
and ([5]) were implemented inside the pythia framwork. 
We used the following common parameters and pythia 
switches: 

• Parton Distribution Function(PDFs): CTEQ 
5L (13; 

• Q 2 = s; 

• MultiParton fnteraction(MPI): "ON"; 

• Initial State Radiation(ISR) and Final State Radi- 
ation (FSR): "ON". 

To get enough statistics for both the signal and the 
backgrounds, we divided the whole analysis into three 
phase space regions determined by the value of the Pt 
of the final state 7 and the jet. For this purpose, the 
following Pt (ckin(3) parameter of pythia) criteria were 
used for different mass points of signal: 

• P T > 180 GeV: 1.0-3.0 TeV, 

• P T > 450 GeV: 3.5-4.5 TeV, 

• P T > 950 GeV: 5.0-6.0 TeV. 

A total of 16 mass points, 11 for coupling strength 
fi = fs = 1.0 (with a step size of 0.5 TeV) and 5 for 
fi = = 0.5 were generated. The different backgrounds 
were also generated in various Pt range. No pseudora- 
pidity restriction was applied while generating the events 
as the large Pt cut requirement naturally restricts the 
events to well within \rj\ < 5.0. We must also mention 



here that in the final selection of 7, we have used the 
fiducial volume of the electromagnetic calorimeter of the 
CMS detector i.e. |ry| < 2.5 with 1.444 < \t]\ < 1.566 ex- 
cluded on account of the insensitive region between the 
barrel and the endcaps[28j. The jets were selected up to 
|?7| < 3.0 only, because of poor resolution in the forward 
calorimeter. 

Fig. [U shows the deviation in the total cross-section of 
qg — > j+jet as function of M q - (= A). Clearly, the varia- 
tion is well-approximated by a A~ 2 contribution superim- 
posed upon a constant (the SM value). This is reflective 
of the fact that, for large A, the new physics contribu- 
tion is dominated by the interference term in Eqs. (|4l5p 
rather than the pure A term . Only if we had imposed 
harder cuts on the photons, would the latter term have 
dominated (albeit at the cost of reducing event numbers 
and hence the sensitivity). 
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V. PHOTON AND JET CANDIDATES AT THE 
GENERATOR LEVEL 

Although a mass peak in the signal will appear as an 
excess of events over the continuum SM background, the 
significance for such an observation will depend on the 
size of this continnum. Hence, to enhance the signal 
peak, it is nccceassary to reduce the background as much 
as possible. For the signal under investigation, QCD di- 
jet is the largest background, as it mimics 7+jet final 
state when one of the jet fakes a photon. To estimate 
this background reasonably at the generator level, it is a 
must to have a proper understanding of the reconstruc- 
tion algorithm for a specific design of the detector rather 
than limit ourselves to only partonic level photon and 
jets from the final state in an event. 

Taking this into consideration, we have used a clus- 
tering; algorithm to account for fake photons arising from 
jets [23]. The electromganetic calorimeter (ECAL) of the 
CMS detector is made up of PbWOn crystals and each 
crystal covers 0.0175 x 0.0175 (equivalently, 1°) in the 
A77— A<p space (0 being the azimuthal angle). For photon 
reconstruction, we have used the "hybrid" algorithm [30j 
where we consider only those final state electromagnetic 
particles (i.e., 7, e + and e~ ) in the 77 — <fi space such that 
neither of the distances Ary and A(j> from the seed object 
exceeds 0.09. This extension is equivalent to a 10 x 10 
crystal size in the CMS detector. The seed for such a clus- 
tering must have a minimum Pt of 1 GeV. A photon can- 
didate is either a direct photon or other electromagnetic 
obejets such as ir a — > 77, p° — > 77 etc. The main con- 
tribution of fake photons comes from ir° — > 77^ 81%), 
V ~ ¥ 77(~ 12%) while other sources give only a small 
contribution. A detailed discussion of this reconstruc- 
tion algorithm at the generator level can be found in a 
previous work [3l[ . 

For jet reconstruction, various algorithms have been 
used by different collider experiments. These include the 
MidpointCon e I32J . IterativeCone [HI, HH , and the K t al- 
gorithms [H, l36l. l37j . The K t and MidPoint algorithms 
are used mostly for offline analysis. Since we have used 
the CMS setup in our analysis, we use the IterativeCone 
algorithm to reconstruct jets at the generator level. Be- 
ing much faster, this is commonly used for software based 
triggers. While the first algorithms for the jets at the 
hadron colliders started with simple cones in the A77— A<p 
space [38j], clustering techniques have greatly imp roved 
in sophistication over the last two decades [32|, [35[ . 

For a real detector, the first step in the reconstruction, 
before invoking the jet algorithm, is to apply noise and 
pile- up suppression with a set of cuts on Et- To make 
"perfect detector jets", we used a seed Pt cut on the 
Pr-ordered final state particles and selected only those 
which had a transverse momentum above the required 
minimum! 2 ] f P Tseed >1.0 GeV. Once the seed is se- 
lected, we search around for all the particles in a cone 
of AR < 0.5. The objects inside the cone are used to 
calculate a proto-jet direction and energy using the E- 



scheme(^Pi). The computed direction is then used to 
seed a new proto-jet. The procedure is repeated until 
both the energy and the direction of the putative jet is 
stable between iterations. We quantify this by requiring 
that the energy should change by less than 1% and the 
direction by less than AR = 0.01. When a stable proto- 
jet is found, all objects in the proto-jet are removed from 
the list of input objects and the stable proto-jet is added 
to the list of jets. The whole procedure is repeated until 
the list is bereft of objects with an Et above the seed 
threshold. The cone size and the seed threshold are the 
parameters of the algorithm. 



VI. SMEARING EFFECTS 

While a detailed and full-scale detector simulation is 
beyond the scope of this work, realistic detector effects 
can easily be approximated. To this end, we smear the 
generator level information with ECAL and HCAL reso- 
lutions of the CMS detector. 

For the ECAL resolution function, wc use the form 



6E a a„ 

— a\ 11 

E VE E 



c 



(6) 



where a denotes the stochastic term, a n is the white noise 
term and C is the constant term, with the three contribu- 
tions being added in quadrature. For each of these terms, 
we use values identical to those for the electromagnetic 
calorimeter of the CMS [30j , namely 

C = 0.55% 

J 2.1 x 10~ 3 GcV M < 1.5 

[ 2.45 x 10" 3 GeV 1.5<H<2.5 

f 2.7 x 10- 2 GeV 1/2 \rj\ < 1.5 
a = < 

[ 5.7 x 10~ 2 GeV 1/2 1.5 < |r?| < 2.5 . 

The resolutions for Aij and A0 were taken to be 0.02 and 
0.001 respectively for both the barrel and endcap. 

For the hadronic calorimeter, the resolutions were once 
again assumed to be the same as those for the CMS 
HCAL Hlli3, namely, 



• Barrel: 



65% 



SE_ _ 
~E ~ y/E/ GeV 

• Endcaps: 
SE 83% 



E 



y/E/ GeV 



© 5%, Arj = 0.04, Acj) = 0.02 



© 5%, Ary = 0.03, Acj) = 0.02 



[2] The seed threshold can vary from 0.5 to 2.0 GeV depending on 
the energy of reconstructed jet. 



7 



• Forward region: 

8E _ 100% 
~E ~ sjEj GeV 



5%,Ar ? = 0.04, A0 = 0.04 



TABLE III: Preselection-efficiency and geometrical accep- 
tence for various SM backgrounds and few signal points. 



The four momenta of the photon and jet were recal- 
culated after applying these resolution effects using an 
appropriate Gaussian smeared function. In Fig. [6j we 
show the effect of resolution on the mass peak for a M q * 
of 1 TeV. 

It should be noted that the ATLAS detector at the 
LHC has a better jet energy resolution with the constant 
term being ^2 % [4l| compare to ~5 % in the CMS 
detector. On the other hand, the CMS ECAL has a bet- 
ter resolution than the ATLAS one owing to a smaller 
constant term. However, with the resolving power being 
dominated by the jet energy resolution, ATLAS should 
do somewhat better. In other words, our results corre- 
spond to a conservative choice. 
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FIG. 6: Effect of smearing on the mass peak for an excited 
quark of 1 TeV. 



VII. KINEMATIC AL VARIABLES 



In Fig. [7J we show the kinematical distributions for the 
leading photon and the leading jet for P^' je > 200 GeV 
dataset for the background and the signal for M q * —1 
TeV. For purpose of visual clarity, the distributions for 
Z + 7 and W + 7 backgrounds have been scaled up by 
a factor of 10. The bump in the transverse momentum 
distributions are primarily driven by the on-shcll produc- 
tion of the q* and, therefore, are centred slightly below 
M q * /2. As is evident from Fig. [Jje), an excess in the in- 
variant mass spectrum would be quite prominent for even 
jL.dt =\pb~ 1 . The i-channel contribution has been in- 
cluded and manifests itself in the elongation of the side 



Selection Cut 


Signal 7 + Jet QCD Z + 7 W + 7 

% % % % % 


P^ jet > 200GeV 
P^ 3et > 500GeV 

p7,iet > 1Te y 


48.7 44.2 0.90 38.4 37.1 
[1 TeV] 

40.2 39.8 0.42 50.4 50.6 
[4 TeV] 

47.4 46.0 0.51 58.8 59.9 

\K TV, vl 
10 ±e v j 


W\ < 2-5, W at \ < 3.0, 
|?7 7 | £ [1.4442, 1.5666] 


42.4 38.2 0.81 32.8 33.2 
[1 TeV] 

38.2 37.8 0.40 47.4 48.4 
[4 TeV] 

46.4 45.0 0.50 56.3 58.7 

[5 TeV] 



bands. It may be noted that the QCD dijet background 
is more than 10 times as large as the signal, but falls 
very rapidly with P^ iCt (the mistagging probability has 
already been included). Fig. [JJf ) shows the distribution 
in the subprocess center of mass scattering angle, with 
cos 6* = tanh[(?7 7 - rf et )/2.Q}. Note that major differ- 
ences between the signal and background profile occur 
only for px and invariant mass distributions, whereas the 
other phase space variables are not very sensitive discrim- 
inants. 



Fig. [8] shows similar distributions as in Fig. [Jj but for 
the M q » = 5 TeV point instead. For these distributions 
we have used a P^ jet cut of 1 TeV at the pre-selection 
level. With the Pt spectrum for the photon from QCD 
background falling very rapidly, the signal dominates 
over the background above 2 TeV even without isolation 
cuts. As for the corresponding invariant mass (M 7 -j et ) 
distribution — see Fig. [5£e) — a combination of the large 
natural width and smearing effects results in a broad 
bump rather than a sharp one. Once again, the other 
distributions do not discriminate between the signal and 
background in any forceful manner. While the the slight 
dip in the central rf region for the W + 7 process might 
seem intriguing (especially in the absence of any such 
dip in the Z + 7 distribution) , it is but a straightforward 
reflection of the well-known Radiation- Amplitude- Zero 
(RAZ) present in the former [HEH!]. That the RAZ in 
the angular distributuion is apparent only for the high 
Py je * cutoff case can be understood by realising that 
the rapidity of the photon as measured in the laboratory 
can be related to the rapidity (scattering angle) in the 
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FIG. 7: Kinematic variable distributions after 200 GeV pre-selection cut on Pt (a) P^, distribution (b) P^, et distribution (c) rj 
distribution (d) rj jet distribution (e) M 1 -j e t distribution and (f) | cos6*\ 1 -j e t- The signal corresponds to M q * = 1 TeV. 



partonic subprocess center of mass frame through 
77(7) = h,n (^j +V*(l) 

where the momentum fractions of the incom- 

ing partons. For small s values (hence lower CKIn(3) 
cuts) the parton densities are maximized when the (anti- 
)quark acquire small(large) momentum fractions respec- 
tively. This leads to a considerably large contribution to 
rp from the boost, thereby smearing the original double 
peaked rp distribution into centrally peaked one. On the 
contrary for typically high s (ckin(3)> 1 TeV) values 
the Xi tend to be not too different thereby reducing the 
smearing on this account. 

In Table III we show the pre-selection efficiencies and 



geometrical acceptences for the CMS detector for various 
backgrounds and signal of M q . =1, 4 and 5 TeV against 
the total generated events. 



VIII. ISOLATION VARIABLES 

In a detector, a photon candidate is reconstructed by 
summing the electromagnetic energy deposition in ECAL 
towers in a limited region of space, with the sum being 
required to be above a certain Et threshold. For the 
sake of simplicity this limited region can be visualized as 
a cone in A77 — A(j> space given by AR = \J A(f> 2 + A77 2 , 
and containing most of the energy of the electromegnetic 
object. 
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A jet fragmenting into neutral and charge hadrons with 
a 7T° — ► 77 (with overlapping photons) carrying max- 
imum momentum can also lead to fake single-photon 
candidates. To remove such events, photons are re- 
quired not to have associated charged tracks within a 
cone of size i?; so - This is implemented by requiring that 
the scalar/vector sum of energy /transverse momentum 
within i?j SO should be below a certain threshold. For ex- 
ample, the D0 and the CDF experiments demand that 
the Et due to charged tracks within a cone of AR = 0.4 
around the photon should be less than a certain value. 
In this anlaysis, we closely follow the CMS detector sim- 
ulation studies and consider the following isolation 
variables: 

• the number of tracks (N tr k) above a certain thresh- 
old inside a cone around the photon candidate. 



• the scalar sum of transverse energy (Etsum^) hi~ 
side a cone around the photon. Although, in a 
full detector simulation the Etsum is measured 
separately for ECAL and HCAL, but working at 
the generator level, we combine them into a sin- 
gle variable with all kinds of electromagnetic and 
hadronic objects around the photon being taken 
into account. 



A. Track Isolation 

For the purpose of track isolation, only 'stable' charged 
particles e.g. n^, K^, and P ± were considered. The 
other particles were found to have only a negligible con- 
tribution. Indeed, tt^ alone contribute more than 80% of 
the charged tracks. Fig [5] shows the distribution of num- 



10 



ber of charged tracks (Ntrk) around the leading photon 
within a cone of size AR < 0.35 for a M q * = 1 TeV signal 
as well as for the total background. Since the leading pho- 
ton is the true photon for signal events, most of them are 
associated with zero tracks (N tr k =0) and the distribu- 
tion falls off very rapidly for larger N tr k values. For back- 
ground events though, the distribution peaks at N tr k ~7- 
8 and then falls slowly. The small rise at N tr k = is due 
to the fact that 7 + jet (SM) and W/Z + 7 backgrounds 
have true photons as the leading photon in the event and 
have no tracks around them, while the rising part along 
with the extended tail is mainly contributed by the QCD 
dijet events where the fake photon typically has a large 
number of tracks around itself. In this study, we accept 
a photon to be an isolated one if there is no track with 
minimum transverse momentum (P^min) within a given 
cone around it. It should be noted that comparative dis- 
tributions of signal and total background, as shown in 
Fig El is n °t overly sensitive to moderate changes in the 
Prpmin va l ue (the exact values arc discussed at a later 
stage). 
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FIG. 10: Highest Pt track around leading photon for the sig- 
nal and the QCD background a) M q * =1 TeV b) Af,» =5 
TeV. An isolation cone of size 0.35 has been used. Both dis- 
tributions are normalized to unity for the sake of comparison. 
Note that the background differs between the two panels on 
account of the differing requirements on Pt (vide Sect llVl l. 



FIG. 9: Number of tracks(iV t7 . fe ) for the signal (M q * =1 TeV) 
and the background events around the photon. 

In pp collisions at the LHC, a large number of soft 
tracks (in the range of a few MeVs to a few GeVs) will 
be produced in each event. The main sources of such 
soft tracks arc ISR, FSR, minimum bias and underly- 
ing events. For a direct photon emerging from the hard 
interaction, such soft tracks could actually be in the 
near vicinity of the photon. Labelling such photons as 
non-isolated ones could potentially reduce the signal effi- 
ciency, and many interesting events, such as those in this 
study, could be lost. To prevent such loss, tracks are usu- 
ally required to pass certain minimum selection criteria, 
with a minumum threshold on the transverse momentum 
being a common requirement [2^, 0, Hf| . Adopting this 
strategy, we investigate the dependence of the signal ef- 
ficiency and the signal/background (S/B) ratio on the 
chosen Pt threshold (-P^mm)' varying the latter between 
1-3 GeV. 



To optimize the value of P^mim ^ IS useful to exam- 
ine both the signal and the QCD dijet background in 
terms of the distribution for the highest-Pr track. In Fig 
ITUTa) and (b) respectively, we display this distribution 
for the signal for M q - = 1 TeV and 5 TeV. Accompany- 
ing these, in each case, are the corresponding QCD dijet 
background. For ease of comparision both the distribu- 
tions are nomralizcd to unity. As is evident, any tracks 
accompanying the photon in a signal event tend to have 
a low Pt, whereas for the background events, the distri- 
bution is a very wide one. An indicative value for the 
optimal -Rfmm i s given by the point of intersection of the 
two normalised distributions (signal and background). 
The optimal choice docs depend on the signal profile (de- 
termined, in a large measure by the typical momentum 
transfers), as is evident from the crossover points being 
~4 (6) GeV for signals corresponding to M q - of 1 (5) TeV. 
Thus, characterizing only those tracks, around a photon, 
with a Pt ~ 4 GeV as true tracks (or, in other words, 
accepting photons with accompanying tracks satisfying 
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Pt < Prmin ~ ^ GeV as true photons) would mean that 
a very large fraction of the signal is retained while a sig- 
nificant fraction of the background is rejected. In Fig.QTJ 
we display the consequent interplay between signal ef- 
ficiency and the signal to background ratio (S/B), for 
two different signal points (M q * =1 TeV and 5 TcV). It 
is evident from the distributions that adopting a higher 
threshold would remarkably increase the signal efficiency 
with only a small loss in the S/B ratio. More importantly, 
the track isolation requirement reduces the fake photon 
events with the major effect showing up in the QCD di- 
jet background. As is obvious, the strict requirement of 
Ntrk = in a given cone around the photon reduces only 
a small fraction of the signal whereas the S/B ratio is 
improved considerably. 

To keep the analysis simple, we then dispense with 
a M q - -dependent choice of the threshold, and instead 
demand P^min = 3 GeV and N tr k = irrespective of 
the mass of the q* being looked for. Although a choice 
of P^min = ^ GeV would have led to better results (see 
Fig [9|) , we make a more conservative choice to account 
for the fact that, in a real detector, the tracking efficiency 
is usually less than 100 %. 
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FIG. 11 

for the photons from a) 1 TeV signal b) 5 TeV signal 



For 



a given threshold (Prinm)) the individual points correspond 
to differing values of the number of tracks, Ntrk, allowed in 
a cone starting with tracks (for the rightmost point) and 
increasing in steps of one. 



B. Et Sum Isolation 

We now discuss the next isolation variable, namely 
the scalar sum of transverse energy inside a cone around 
the photon candidate. FigsfT2hfb) show respectively the 
Etsum distribution for the leading photon for M q * = 
f (5) TeV for a cone of size AR — 0.35. Both distributions 
are normalized for an integrated luminosity of 1 pb _1 . It 
is evident that, in either case, a large fraction of sig- 
nal events have Etsum < 5.0 GeV whereas the back- 
ground events generically have Etsum > 5 GeV. For 
Mq* = 5 TeV, the discriminating point is even slightly 
higher. 

As in the previous subsection, we next study the de- 
pendence of signal efficiency and S/B ratio on the choice 
of the Etsum threshold. Fig H3T a) and (b) respectively 
show the signal efficiency vs. S/B ratio for 1 TeV and 5 
TeV signal points. It is evident that, for a given signal 
efficiency, a higher S/B ratio can be attained for larger 
cone sizes. For example, demanding AR <0.35 leads to a 
large signal efficiency (~ 92%) and S/B > 0.88 for either 
choices of M q * . On the other hand, any relaxation be- 
yond Etsum > 5.0 GeV reduces S/B considerably with 
only a very small gain in signal efficiency Several Etsum 
thresholds for different cone sizes were analyzed along 
with track isolation requirements to optimize signal effi- 
ciency along with the S/B ratio. 
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FIG. 12: Etsum for the background and the signal events 
around photons for a) M q * =1 TeV b) M q * =5 TeV Signal. 
Distributions are normalized for J Ldt = lpb -1 . 
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FIG. 13: Signal efficiency vs. S/B ratio for different cone sizes for different choices of the Etsum threshold around the leading 
photon for (a) M q * =1 TeV and (b) M q * — 5 TeV. For each choice of the cone size, individual points correspond to a particular 
choice for the Etsum threshold in that cone, starting with 1 GeV at the rightmost point and going up in steps of 1 GeV. 



TABLE IV: Fraction of events surviving for the signal and various backgrounds for different isolation cuts (after Pt Cut). 



Riso 


N trk 


(GeV) 


f^TSUM 

(GeV) 




S 
(%) 


QCD 

(%) 


7 + Jet 
(%) 


Z + j 
(%) 


W + 7 
(%) 


(S/B) 










M q , 


= 1.0 TeV 












0.30 





1.0 
2.0 
3.0 


5.0 
6.0 
5.0 
6.0 
5.0 
6.0 




73.6 
73.7 
81.2 
81.7 
83.0 
83.7 


0.93 
0.94 
1.12 
1.15 
1.19 
1.25 


72.9 
73.1 
80.6 
81.0 
82.3 
83.1 


75.0 
75.1 
82.9 
83.3 
84.8 
85.6 


71.9 
72.0 
79.4 
79.9 
81.2 
81.9 


0.970 
0.967 
0.951 
0.946 
0.941 
0.930 


0.35 





1.0 
2.0 
.3.0 


5.0 
6.0 
5.0 
6.0 
5.0 
6.0 




69.8 
70.1 
79.0 
79.8 
81.0 
82.2 


0.82 
0.83 
1.01 
1.05 
1.08 
1.14 


69.3 
69.5 
78.4 
79.1 
80.4 
81.6 


71.1 
71.4 
80.5 
81.4 
82.6 
83.9 


68.2 
68.5 
77.2 
78.0 
79.2 
80.4 


0.984 
0.982 
0.967 
0.960 
0.957 
0.947 










M q * 


= 5.0 TeV 












0.30 





1.0 
2.0 
3.0 


5.0 
6.0 
5.0 
6.0 
5.0 
6.0 




82.9 
83.1 
91.1 
91.5 
92.9 
93.7 


1.82 
1.83 
2.11 
2.14 
2.17 
2.22 


83.1 
83.2 
91.1 
91.6 
93.0 
93.7 


83.3 
83.5 
91.5 
91.9 
93.4 
94.2 


81.6 
81.8 
89.5 
89.9 
91.3 
92.1 


0.955 
0.954 
0.950 
0.950 
0.949 
0.947 


0.35 





1.0 
2.0 
3.0 


5.0 
6.0 
5.0 
6.0 
5.0 
6.0 




78.8 
79.0 
88.6 
89.3 
90.6 
91.9 


1.63 
1.64 
1.94 
1.97 
1.99 
2.04 


79.0 
79.2 
88.7 
89.4 
90.7 
91.9 


79.3 
79.5 
89.0 
89.8 
91.1 
92.4 


77.7 
77.9 
87.2 
87.9 
89.2 
90.5 


0.960 
0.960 
0.956 
0.956 
0.955 
0.954 



C. Final Selection Cuts 



In Tabic IV, wc show the efficiencies for signal and 
background for all the isolation variables with differing 
thresholds. Since we aim to observe any excess as a mass 
peak over the SM continuum and, in the early phase of 
the LHC operation, would be able to identify a signal 



only for low masses, it is rather important to have a large 
signal efficiency and S/B ratio for smaller M q * . Hence we 
have used the isolation criteria befitting a 1 TeV signal 
point (note that this also works reasonably for higher 
M 9 *s), and performed this analysis for all the different 
signal points considered in this study. Note that, while 
it is indeed possible to have yet other criteria to select 
different threshold based on real detector simulation, the 
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TABLE V: Number of events 
isolation cuts. 


surviving 


for the M q * 


= A = 


1 TeV signal and the backgrounds for J Ldt 


= lOOpb -1 


for different 


Riso Ntrk 


rytrk 

T'min 

(GeV) 


rpmax 
^TSUM 

(GeV) 


S 


QCD 


7 + Jet 


Z + 7 


W + 7 Tot. Background 


(S+B) 


0.30 


1.0 
3.0 


5.0 
6.0 
5.0 
6.0 


2734 
2740 
3085 
3112 


626.2 
634.4 
803.0 
845.9 


2185 
2190 
2467 
2490 


4.10 
4.11 
4.64 
4.68 


2.97 
2.98 
3.36 
3.39 


2818 
2831 
3278 
3344 


3368 
3381 
3896 
3966 


0.35 


1.0 
3.0 


5.0 
6.0 
5.0 
6.0 


2596 
2604 
3011 
3054 


554.0 
560.8 
727.4 
772.1 


2076 
2083 
2409 
2444 


3.89 
3.91 
4.52 
4.59 


2.82 
2.83 
3.28 
3.32 


2637 
2650 
3144 
3224 


3157 
3172 
3747 
3834 



qualitative differences in the results are small. 

Based on these studies, the final selection cuts applied 
are as follows (the P^, ,jet requirements being determined 
by the range of M g * being investigated, vide Scct llVI) : 

• p^ et > 200 GeV(500 GeV, 1 TeV); 



• |?? 7 | < 2.5 



\rf\ <£ [1.4442,1.5666]; 



J 



• \r]i et \ < 3.0; 

• N trk = for P|T fe > 3.0 GeV within R iso < 0.35; 

• Etsum < 5.0 GeV within R lso < 0.35. 



In Table V, we show the expected number of events for M q * = A = 1 TeV for an integrated luminosity of 100 pb~ 
for various combinations of isolation variables discussed above. 

In Fig. [14] we have shown the invariant mass distribution for both signal+background(S+B) and background(B) 
after the all selection cuts. 




FIG. 14: Invariant mass of 7— jet system for Signal+Bacgkround and Background after all the isolation and kinematical cuts, 
(a) M q * = ITeV (b) M q > = 5TeV. 



IX. SIGNIFICANCE LEVEL 

For reporting a discovery significance, we adopt a fre- 
qucntist Monte-Carlo technique based on a method of 
hypothesis testing originally due to Neyman and Pear- 
son [29|, |4(| H3| ■ The aim is to determine which one of two 
competing hypotheses, the so called null hypothesis (H ) 
and the alternative hypothesis (Hi) is favoured by the 
data. In the present context, the SM only case (back- 



ground) constitutes the null hypothesis (H ) and the 
presence of new physics (i.e. excited quark contribution 
to the final state) alongwith the SM is the alternative 
hypothesis (Hi). Heretofore, Hq and Hi will also be 
referred to as background only (B) and signal plus back- 
ground (S+B) hypotheses. 

In the Neyman-Pearson method one aims to design a 
test which minimizes the probability (3 of erroneously re- 
jecting an alternative hypothesis when it is actually true. 
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Understandably, 1 — /3 is defined as the power of a test 
and the most powerful (MP) test is the one which maxi- 
mizes the power for a given value of the probability a of 
rejecting the null hypothesis as false, when it is true in- 
stead. According to the Neyman Pearson lemma [46| , the 
condition for the MP test is obtained as a condition on 
the log likelihood ratio (LLR) of a given dataset coming 
from the null or the alternative hypothesis. Even when 
a MP test does not exist, the LLR statistic can be used 
for testing between two hypothesis due to its statistically 
desirable properties [48]. One accepts or rejects H based 
on the value of the LLR computed from the data. If the 
value of the LLR falls within a range of values (the criti- 
cal region) which is unlikely to come from Hq then Hq is 
rejected. Now, a as defined above is clearly the probabil- 
ity of the LLR value falling in the critical region when Hq 
is true. Hence, 1 — a is reported as the significance level 
of rejecting Hq (i.e., the SM in our case), or, in other 
words, this is the discovery confidence level. 

In general, a LLR test can be constructed out of one or 
many discriminating quantities (e.g. Pt, angular separa- 
tion etc.). In this analysis, the LLR has been constructed 
out of a single discriminating variable, namely M-y—j e t, 
the invariant mass of the leading 7 and jet. While this is 
obviously the most sensitive discriminant in the case of 
on-shell production, it plays an important role even for 
virtual exchanges 31 1 . 

The likelihood ratio is defined as the ratio of Poisson 
probabilities: 



Q 



P poiss (data\S + B) 
Ppoiss(data\B) 



Ppoiss(data\S + B) = 1 ' + , (7) 



P„. 



^datalB) 



(biY 



ml 
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where Sj + bi are the number of events expected in the 
i th bin of the M 7 _j et histogram according to the S+B 
hypothesis whereas bi correspond to the B hypothesis. 
rii here denote the number of events in the i bin of 
M^-j e t histogram from "data" . All efficiencies arc to be 
folded in Sj, bi and 71%. 

The LLR statistic is then given by the expression, 



- 21nQ = 2 



ribinB 

E 

i=l 



2 J2 rii In (l 



(8) 



erated as a random Poisson fluctuation around the mean 
value of the i th bin of the theoretical M 1 _j et histogram. 
The significance level is defined as, 

a=l-GL B =P{Q<Q obs \B), (9) 

the fraction of experiments in a large ensemble of back- 
ground only experiments which would produce results 
more signal-like than the observed data. By definition, 
a S+B hypothesis is "confirmed" at the 5a (3a) level if 
a<2.8 x 1(T 7 (1.35 x 1(T 3 ) g|. 

In Fig. [15l we show the LLR distribution for B and 
S+B type hypotheses for two different mass points. The 
luminosities have been chosen so as to yield a 5a signif- 
icance for the S+B hypothesis. We have used 10 7 MC 
trials for these distributions. In this figure, 1 — CLb is 
the fraction of MC trials of background type hypothesis 
which falls to the left side of peak value of LLRs+b- 
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FIG. 15: Log likelihood ratio distributions for S+B and B 
type hypotheses for a 5<r— significance for (a) 1.0 TeV (b) 5.0 
TeV q* states. 



The "rfaia" rii in the i bin of the test variable is gen- 



In Fig. [16] we show how the LLR discrimination between two types of hypothesis behave as a function of mass 
window around the M q * value. As visible from the figure, beyond ±3r(g*) the two hypthesis looks similar and hence 
it does not contribute to the significance level. We found similar results for all signal points and for the final selection 
we have used ±3r(g*) as the mass window around 7 — jet invariant mass. 
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X. BACKGROUND SUBTRACTED 
INVARIANT MASS 



In this section, we describe a procedure to estimate 
the number of events under the mass peak in the case of 
a discovery (i.e. if the data supports the S+B hypoth- 
esis). Assuming an excess centred aproximately around 
M 7 _jet = Mo, the first step constitutes fitting the data 
over a M 7 _j Ct range centred around Mq but much wider 
than the region of the excess, the aim being to fit the 
background as well as the sidebands. While in a real 
experiment one would attempt to fit the sidebands from 
data alone, here we use a large MC sample to determine 
the shape of the sidebands and find that an exponential 
describes them well (see Fig. [171 for M = ITeV). To 
generate realistic distributions, we consider (s + b) in 
each bin to be an independent Poisson distributed (and 
integer valued) variable with a mean equalling the theo- 
retically expected number of events. A random fluctation 
was then used to generate the "experimentally observed" 
events in the bin concerned. For a good background 
fit on the (S+B) distribution, an identified excess has 
clearly to be left out. To this end, we leave out the range 
~ [Mo — 3 Tq , Md + 3 To] consonant with the binning algo- 
rithm where To = r(M g « = Mo). For a x 2 minimization 
of the fit, the MINUIT [5(j package was used within the 
ROOT framework The fit in Fig. [TTJa) was done 

for an integrated luminosity of 1 fb _1 although a 5a sig- 
nal significance for M q * = 1 TeV is attainable with only 
10pb _1 of data. Fig \T7\ b) shows the background sub- 
tracted mass distribution for M q * =1 TeV. Here we have 
used a single Gaussian to fit the mass spectrum. 

While an integrated luminosity of 1 fb _1 for new 
physics mass measurements would normally be consid- 
ered meagre when compared to the LHC design parame- 
ters, it is interesting to consider the physics possibilities 
with far lower luminosities. To this end, we present, in 
Fig [HI analogous distributions for only 10pb _1 . While 
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FIG. 17: (a)Background fit on the (S+B) distribution with an 
exponential function for 1.0 TeV q* for an integrated luminos- 
ity of lfb" 1 . (b) The corresponding background subtracted 
invariant mass distribution. 
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FIG. 18: As in Fig ll7l but for an integrated luminosity of 
10pb _1 instead. 



the fit for the background is, 
as in the earlier case, once 
tial fit is accepted, the back; 
still very convincing. In Fij; 
events under the Gaussian 
GeV mass range was found 
uncertainty due to error on 
to be at most 4.9 events. 



understandably, not as good 
the validity of an exponen- 
jround subtracted mass fit is 
5 ITBT b) the number of signal 
fit and within the 800-1200 
to be 30.5 ± 5.5(stat.). The 
fitting parameters are found 
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FIG. 19: Required integrated luminosity as a function of M q * 
for (a) 5<r and (b) 3a significance for two different coupling 
strengths (systematic uncertainties are not included). 



We note in passing that in an actual detector at the 
LHC the mass peak will have a tail on the lower mass 
side due to partial containment of showers and fitting 
this may need a Gaussian modified with a Landau or 
some other asymmetric distribution thereby broadening 
the mass peak somewhat. 

The invariant mass distribution has two components, 
the natural Lorcntzian part for an unstable particle with 
a large width and a Gaussian (or a double-Gaussian) dis- 
tribution due to resolution effects. The combined distri- 
bution is a convolution of the two above. Although the 
combined distribution is thus not a simple one, a single 
Gaussian fits the mass peak reasonably well and hence 
we choose to fit the peak with a simple Gaussian. 



XI. RESULTS 

Fig. [T9T a) and (b) respectively show the luminosity 
needed to achieve 5a (3a) significance for the signal as 
a function of the excited quark mass. We find that the 
result obtained using (—2 InQ) are consistent with those 
from s/y/b test statistic. In estimating the required lu- 
minosity, we have exploited only the mass peak region of 
the signal over the SM background. We have used a mass 
window of ~ ±3r(g*) around M q *. In a previous section 
we have shown that beyond ±3r(q*) the discriminating 
statistic, namely LLR, looks similar for S+B and B. 

We have checked the stability of the results by varying 
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TABLE VI: Cross sections for various M q * values after imposing all kinematical and isolation cuts. 



S.N. 


M q . a 


AM 7 _j 


a(S + B) 


a(B) 




a(S*)" 




Efficiency(S+B) 


Efnciency(B) 


Efficiency (S*) 




(TeV) 


±3r(q*) (GeV) 


(Pb) 


(Pb) 




(Pb) 




(%) 


(%) 


(%) 


1 


1.0 


800-1200 


9.26 


4.92 




4.34 




1.304 


0.699 


60.28 


2 


1.5 


1200-1800 


2.034 


1.33 




0.694 




0.288 


0.190 


46.71 


3 


2.0 


1600-2400 


6.72 x 10" 1 


5.10 x 10" 


i 


1.61 x 10" 


i 


0.095 


0.072 


37.23 


4 


2.5 


2000-3000 


2.54 x 10 _1 


2.10 x 10" 


-l 


4.41 x 10" 


12 


0.036 


0.029 


40.67 


5 


3.0 


2450-3550 


7.85 x 10 -2 


6.44 x 10" 


2 


1.40 x 10" 


2 


0.011 


0.009 


75.95 


6 


3.5 


2900-4150 


1.11 x 10 -2 


6.93 x 10" 


3 


4.17 x 10" 


A 


0.274 


0.172 


24.70 


7 


4.0 


3300-4700 


4.90 x 10 -3 


3.40 x 10" 


3 


1.50 x 10" 


3 


0.121 


0.084 


15.60 


8 


4.5 


3700-5300 


2.20 x 10 -3 


1.57 x 10" 


-3 


6.37 x 10" 


-4 


0.054 


0.039 


11.48 


9 


5.0 


4150-5850 


4.60 x 10 -4 


2.47 x 10" 


4 


2.12 x 10" 


4 


0.628 


0.342 


22.49 


10 


5.5 


4500-6450 


2.17 x 10~ 4 


1.29 x 10" 


4 


8.81 x 10" 


-5 


0.299 


0.179 


14.91 


11 


6.0 


5000-7000 


8.39 x 10" 5 


5.14 x 10" 


5 


3.24 x 10" 


5 


0.115 


0.071 


7.85 



°Here fi = h = 1-0 

'Pure New Physics contribution evaluated by subtracting B from (S + B) 



TABLE VII: As in Table|VT] with additional requirement of centrality (|r? 7 ' jet | < 1.5). 



S.N. 


M q , a 


AM 7 _j 


a{S + B) 


a(B) 




a(S*)" 




Efficiency(S+B) 


Efficiency (B) 


Efficiency(S*) 




(TeV) 


±3L(g*) (GeV) 


(Pb) 


(Pb) 




(Pb) 




(%) 


(%) 


(%) 


1 


1.0 


800-1200 


4.75 


2.14 




2.61 




0.668 


0.304 


36.22 


2 


1.5 


1200-1800 


0.87 


0.41 




0.45 




0.123 


0.059 


30.64 


3 


2.0 


1600-2400 


2.27 x 10 _1 


1.10 x 10" 


i 


1.16 x 10" 


-l 


0.032 


0.015 


26.82 


4 


2.5 


2000-3000 


6.54 x 10" 2 


3.43 x 10" 


2 


3.11 x 10" 


2 


0.009 


0.004 


28.66 


5 


3.0 


2450-3550 


2.21 x 10" 2 


1.27 x 10" 


2 


9.40 x 10" 


-3 


0.003 


0.001 


50.95 


6 


3.5 


2900-4150 


6.67 x 10" 3 


3.20 x 10" 


l .i 


3.47 x 10" 


l A 


0.165 


0.079 


20.55 


7 


4.0 


3300-4700 


2.64 x 10" 3 


1.30 x 10" 


3 


1.34 x 10" 


3 


0.065 


0.032 


13.93 


8 


4.5 


3700-5300 


1.01 x 10" 3 


4.59 x 10" 


4 


5.51 x 10" 


4 


0.025 


0.011 


9.92 


9 


5.0 


4150-5850 


3.99 x 10 -4 


2.00 x 10" 


4 


1.98 x 10" 


4 


0.545 


0.277 


20.98 


10 


5.5 


4500-6450 


1.79 x 10~ 4 


9.78 x 10" 


5 


8.15 x 10" 


-5 


0.246 


0.135 


13.78 


11 


6.0 


5000-7000 


6.51 x 10" 5 


3.44 x 10" 


5 


3.07 x 10" 


5 


0.089 


0.047 


7.43 



"Here h = h = 1-0 

'Pure New Physics contribution evaluated by subtracting B from (S+B) 



the bin width of the invariant mass distribution from 
50 GeV to 20 GeV for both M q * = 1.0 TeV and 2.0 
TeV and find that the luminosity required for 5a sig- 
nificance changes by 20% and 1.1 % respectively For 
M q - = 5.0 TeV, on the other hand, we varied the bin 
width from 50 GeV to 100 GeV and found that the re- 



quired luminosity changes by 2.1 %. Similarly we in- 
creased the number of Monte Carlo trials by a factor of 
10 and found that the required luminosity changes by ~ 
20 %, 0.8 % and 2.1 % respectively for the 1.0, 2.0 and 
5.0 TeV mass points. 



In Fig. [T9T a). corresponding to the /; = 1.0 case, we also demonstrate the effect of restricting the photon and the jet 
to the central region of the calorimeter on the required luminosity. For 5<r significance, the latter reduces by ~ 30% 
upto a mass of 4.5 TeV. For M q - ~ 5.0 TeV, though, the signal events are mostly produced in the central region and 
hence the requirement |?7 7 ' : ' e *| < 1.5 does not affect the final result significantly. It is also shown that within the 
present model of compositenes for M q * ~ 5.5 TeV, a ha significance can not be achieved at a centre of mass energy of 
14 TeV. While this might seem to run counter to previous work, note that, unlike in the earlier efforts, we have used 
unitarized amplitudes and hence our cross sections are naturally smaller than theirs. 

In Table VI, we show the results for cross section and efficiency for S+B and B after all the kinematical and isolation 
selection cuts have been imposed for various M q * values. Tabel VII shows similar information for central events alone 
{\rp' jet \ <1.5). ^ 

XII. SYSTEMATIC UNCERTAINTIES uncertainties for the CMS setup, here we present an es- 

Since we have performed a detailed analysis includ- 
ing a realistic simulation of various detector effects and 
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timation of systematic uncertainties. For this we have 
considered only the dominant contribution both in the 
signal and the backgrounds. For both the signal and 
the 7 + jet background we concentrate on the dominant 
process, viz. qg — > qj. For the QCD dijet background, 
all the available processes in PYTHIA were used for the 
estimation of the uncertainty. We did not account for 
qq — > 7 + jet,W/Z(jj) + 7 and gg — > 7 + jet as they con- 
tribute only a small fraction of the total background and 
systematic unccrtainities in these can safely be neglected. 

• Choice of the parton distributions (PDF): To esti- 
mate the uncertainty in the cross sections due to 
the choice of the PDF, the former were recalcu- 
lated for three additional PDFs, namely CTEQ6L, 
CTEQ6M [12 and MRST2001 Using the 
LHApdf package [54 |. the results for each were 
compared to those for our default choice, namely 
CTEQ5L [27| . While the resultant cross sections 
turned out to be higher for CTEQ6M (it should 
be noted that the CTEQ6M distributions are NLO 
and hence their use with LO calculations is fraught 
with danger) and MRST2001 distributions, for 
CTEQ6L they turned out to be lower for almost 
all the signal points. As can be expected, the un- 
certainty in the cross section increases with M q », 
simply because one starts to sample an ill-explored 
region in the (Q 2 ,x) plane. For CTEQ6M and 
MRST2001, the relative deviation varies between 
2.3-13.0 % and 2.6-14.2 % respectively as M 9 » 
changes from 1 TeV to 6 TeV. For CTEQ6L, the 
variation was found to be within —4.5 to +2.25 
%. These numbers are quite consistent with those 
applicable for the SM 7 + jet process alone, for 
which the corresponding numbers are 5.6-11.0% 
(CTEQ6M) and 6.0-12. 0%(MRS2001). Similarly, 
for the dijet background an uncertatinty of 9-16% 
(CTEQ6M) and 8.7-16.5% (MRST2001) was esti- 
mated. 

We have not only used different PDFs but have also 
evaluated uncertainty due to a given proton PDF 
by varying the errors on the parameters of the PDF 
fit itself. For this, we chose CTEQ6L (with NLO a s 
and LO fit) and its 40 subset PDFs. The uncerainty 
was found to be ~ ± 1% for a 1 TeV q* state and 
-8.29% to +10.93% for a 5 TeV one. For QCD 
di-jet and 7 + Jet background these numbers were 
found to be -9.81% to +13.74% and -8.04% to 
+10.54% respectively. 

• Scale Variation: To estimate the dependence of the 
signal and the background cross-sections on the 
choice of the factorization scale Q (default value 
in our analysis being y/§), they were recalculated 
for two other values of the latter viz Q 2 = P T and 
Q 2 = —i. Both these choices for the scale would 
have resulted in a higher cross-section compared to 
Q 2 = s. The deviation increases with M q * and 



ranges between 2.1-11.3% for Q 2 = —t and 10.6- 
25.0% for Q 2 = P| case. For the QCD di-jet back- 
ground the maximum deviation was found to be 
-39 % while for 7 + jet it was -26 %. Thus, the 
overall significance of the signal remains largely un- 
altered. 

• Higher-order effects: For the background, these 
have been studied in detail both theoretically and 
experimentally. For example, 7 + je£ production in 
the SM has been studied in depth using the NLO 
parton level Monte Carlo program JETPHOX [55], 
|56j. Recently, a comparison of these predictions 
have been done with the Tevatron data[57|. Un- 
fortunately, the Py-dependent shape of the triple 
differential cross section(<i 3 a / dpTpdy 1 dy^*) for dif- 
ferent pseudorapidity ranges is not explained sat- 
isfactorily by the NLO calculation. The reason is 
not hard to fathom. A comparison with data ne- 
cessitates the imposition of isolation cuts. On the 
other hand, the NLO calculations depend crucially 
on the choice of isolation cuts and infrared safety 
needs to be taken care of. This has been discussed 
in Ref-Hl]. Modulo such subtleties, an effective 
and easy way to incorporate higher order effects 
is to include if-factors. For 7 + jet production, 
the /T-factor lies in the range 1.0-1.66 depending 
on the details of jet fragmentation (primarily, to a 
7/71"°) [59f]. While the if-factor for our case is not 
known, in the large M q - limit it is not expected to 
be too different from the SM case. Close to thresh- 
old, the /^-factor is normally expected to be even 
bigger. However, given the attendant theoretical 
complications, we adopt a conservative approach 
and ignore all X-factors in this analysis. 

• Jet energy resolution: To incorporate finite detec- 
tor resolutions, the photon and jet four momenta 
were smeared with a energy resolution as given in 
section VI. For the photon Pt range considered in 
this analysis, we expect the constant term (C) to 
be the dominant source of error and it contributes 
about 0.55%. To estimate the effect of the jet en- 
ergy resolution on this analysis, we redid this anal- 
ysis smearing the four momenta of the jet with an 
energy resolution of 100% for the barrel region and 
150 % for the endcaps and the forward regions. The 
effect was studied for two different mass state, viz. 
1.0 TeV and 5.0 TeV. It was found that such a 
large worsening of the jet energy resolution would 
increase the luminosity required for a 5cr signifi- 
cance by about 30% (1%) for 1.0 TeV (5.0 TeV) 
mass states respectively. However, if we increase 
the number of MC trials by a factor of 10 (to sta- 
blize the peak value of LLR$+b) then these num- 
bers were found to be well within 2%. 

• Uncertainty due to pre-selection: The systematic 
uncertainty due to preselection in the Pt range of 
this study is found to be less than 1%. 
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• Luminosity error: For the CMS experiment, this 
error is expected to be ~ 10% for an integrated lu- 
minosity of 1 fb _1 [60 1 an d ~ 3% for an integrated 
luminosity of 30 fb _1 611 ] . 

• To estimate the combined effect of the aforemen- 
tioned uncertatinties we lowered the signal cross- 
section by 5% for the 5.0 TeV mass point and found 
that the luminosity for 5er significance increases 
by ~ 10.9%. On the other hand, if we increase 
the background cross-section by 5% for the same 
mass point then the required luminosity increases 
by ~ 5.1%. 



XIII. CONCLUSIONS 

To summarise, we have investigated the potential of 
using a direct photon (in association with a single hard 
jet) final state at the LHC to probe possible quark ex- 
citations. Such states arise naturally in a variety of sce- 
narios, ranging from Kaluza-Klein excitations in extra- 
dimensional models to theories wherein the quarks them- 
selves are composed of more fundamental objects (pre- 
ons). And, as far as the concerns of the present analy- 
sis go, even other fundamental quarks (as often appear 
in theories with extended symmetries, gauged or global) 
could lead to similar signals and, hence, be discoverable. 

In any such model, the excited states may couple 
to their SM counterparts only through a generalised 
(chromo-)magnetic transition term in an effective La- 
grangian. Consequently, the presence of such states 
would alter the direct photon cross section, whether 
through an on-shell production (and subsequent decay) 
of the q* or through an off-shell exchange (both s- and 
t-channel). The extent of the deviation depends on both 
the mass M g * and the compositeness/excitation scale A. 
The deviation concentrates in the large pt regime, espe- 
cially for larger M q * and can be quite substantial. 

Two points need to be noted here. The first and 
straightforward one relates to the width of the excited 
state. With T(q*) being typically quite large, a narrow- 
width approximation does not hold and the full matrix 
element needs to be incorporated. The second issue is 



more subtle and is connected to the non-renormalizablc 
nature of the effective Lagrangian. Since a naive use of 
a (chromo-)magnctic dipolc moment vertex leads to a 
cross section constant or even growing with the center of 
mass energy, the amplitude needs to be unitarized. This, 
understandably, leads to a suppression of the cross sec- 
tions, a fact often ignored in experimental analyses, but 
included here. 

Using the photon and jet reconstruction algorithms as 
used for the CMS dectector at the LHC, we have per- 
formed a realistic estimation of the deviation caused by 
the excited quark exchange contribution to the direct 
photon plus a single hard jet rate. We have accounted 
for all major backgrounds to evaluate the limits in the 
A — M q * parameter space. With the imposition of mod- 
erate restrictions on the rapidities (as dictated by the de- 
tector acceptances), but stringent cuts on the transverse 
momenta, the background can be beaten down severely 
without any damaging loss of signal. A most crucial in- 
gredient is the application of reasonably stringent isola- 
tion criteria, as it helps control the orders of magnitude 
larger backgrounds from QCD dijet production with one 
jet faking a photon. The consequent exclusion limits that 
may be reached are very strong. While it may seem that 
these are still not as strong as some quoted in the liter- 
ature, it should be realised that most of the latter have 
worked with a non-unitariscd cross section and hence the 
two cannot be compared directly. 
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